Comparison of chloroplast genomes and phylogenomics in the Ficus sarmentosa complex (Moraceae)

Due to maternal inheritance and minimal rearrangement, the chloroplast genome is an important genetic resource for evolutionary studies. However, the evolutionary dynamics and phylogenetic performance of chloroplast genomes in closely related species are poorly characterized, particularly in taxonomically complex and species-rich groups. The taxonomically unresolved Ficus sarmentosa species complex (Moraceae) comprises approximately 20 taxa with unclear genetic background. In this study, we explored the evolutionary dynamics, hotspot loci, and phylogenetic performance of thirteen chloroplast genomes (including eleven newly obtained and two downloaded from NCBI) representing the F. sarmentosa complex. Their sequence lengths, IR boundaries, repeat sequences, and codon usage were compared. Both sequence length and IR boundaries were found to be highly conserved. All four categories of long repeat sequences were found across all 13 chloroplast genomes, with palindromic and forward sequences being the most common. The number of simple sequence repeat (SSR) loci varied from 175 (F. dinganensis and F. howii) to 190 (F. polynervis), with the dinucleotide motif appearing the most frequently. Relative synonymous codon usage (RSCU) analysis indicated that codons ending with A/T were prior to those ending with C/T. The majority of coding sequence regions were found to have undergone negative selection with the exception of ten genes (accD, clpP, ndhK, rbcL, rpl20, rpl22, rpl23, rpoC1, rps15, and rps4) which exhibited potential positive selective signatures. Five hypervariable genic regions (rps15, ycf1, rpoA, ndhF, and rpl22) and five hypervariable intergenic regions (trnH-GUG-psbA, rpl32-trnL-UAG, psbZ-trnG-GCC, trnK-UUU-rps16 and ndhF-rpl32) were identified. Overall, phylogenomic analysis based on 123 Ficus chloroplast genomes showed promise for studying the evolutionary relationships in Ficus, despite cyto-nuclear discordance. Furthermore, based on the phylogenetic performance of the F. sarmentosa complex and F. auriculata complex, the chloroplast genome also exhibited a promising phylogenetic resolution in closely related species.


Introduction
The genus Ficus L. (Moraceae) is a species-rich taxon which contains at least 800 species and is widely distributed across tropical and subtropical regions [1][2][3]. Due to insufficient genetic differentiation, the genus Ficus is taxonomically complex and contains many sympatric species, including the F. pedunculosa group, F. punctata group, F. chartacea group, and F. subulata group, among others [1,3,4]. Meanwhile, widely not-strict one-to-one obligate mutualism between fig trees and fig wasps has resulted in frequent hybridization and introgression among Ficus species [5][6][7][8], which has so far hindered research on the taxonomy and evolutionary history [9]. Our current understanding of the Ficus phylogenetic framework is the result of research on a few nuclear loci, such as ITS, ETS, G3pdh, GBSSI, and waxy [3,4,[10][11][12][13][14]. Although nuclear genome data have been used to reconstruct the Ficus phylogeny [15][16][17], these studies represent less than ten percent of Ficus species, which is unlikely to accurately represent the evolutionary relationship in the genus Ficus.
While the use of nuclear genome data has advantages for the detection of hybridization and introgression, organellar genomic resources are also of great importance to evolutionary research due to maternal inheritance as a single unit [18,19]. In the last decade, due to the rapidly decreasing cost of whole-genome sequencing (WGS) and the development of chloroplast genomic assembling pipelines, such as GerOrganelle [20], Fast-Plast (https://github.com/ mrmckain/Fast-Plast), NOVOPlasty [21], and ORG.asm (https://git.metabarcoding.org/orgasm/org-asm), plastome-based evolutionary research has become easier and more cost-effective [22][23][24]. However, chloroplast genomes are publicly available for less than five percent of species in Ficus [25]. Bruun-Lund et al. [26] published a novel and innovative Ficus phylogenetic framework based on 59 newly obtained chloroplast genomes, and the results of which were obviously inconsistent with phylogenies based on the nuclear genome. Unfortunately, many of the genome sequences used by Bruun-Lund et al. contained gaps (an average of 7% missing data with a maximum of 65%), leading to difficulty in comparative chloroplast genomics and possible phylogenetic artifacts [27]. Even so, the chloroplast framework of Brunn-Lund et al. has been replicated by subsequent research generally with an extended dataset [17].
Overall, according to studies in both Ficus and the other taxa [28][29][30], it has been verified to be effective for chloroplast genomes to reconstruct the phylogenetic reference at infra-, intergeneric, and higher ranks. However, little is known about the phylogenetic performance of complete chloroplast genomes in studies of closely related species or even infra-species, particularly in the taxonomically complex genera such as Ficus.
The Ficus subg. Synoecia sect. Rhizocladus subsect. Plagiostigma comprises approximately 10 species and 11 varieties which are widely distributed across east Asia, parapatric to the distribution center of Ficus southeast Asia [1,3,13,16,31]. Subsect. Plagiostigma is genetically unclear and taxonomically unresolved, forming the Ficus sarmentosa species complex [4]. After the first systematic treatment of the F. sarmentosa complex by Corner in 1960 and1965 [32, 33], few studies have attempted to unravel the taxonomic complexity of this group with the exception of descriptions of several controversial taxa in the 1980s [1,[34][35][36][37] (such as F. dinganensis, F. guizhouensis, and F. polynervis) and the rank elevation of some varieties (such as F. pubigera var. anserina and F. sarmentosa var. thunbergii) [38][39][40]. More recently, some of this phylogenetic ambiguity was resolved through the molecular work of Zhang et al. [4]. Zhang's work resulted in the rank elevation of F. pubigera var. anserina and the discovery that F. sarmentosa is not monophyletic, complicating the relationship between the complex and these previously described species in the 1980s. However, because only three loci (ITS, ETS, and G3pdh) were used to resolve the genetic background of the complex, the results were neither stable nor highly resolved [4]. The inclusion of more variable genetic loci or genome data should be helpful to resolve the taxonomic uncertainty of the F. sarmentosa complex. In this study, we utilized both comparative chloroplast genomics and phylogenomics to characterize 1) the diversity of hotspot loci, 2) the variation among chloroplast genomes, and 3) the potential of chloroplast genomes to resolve the evolutionary relationships between closely related species of the F. sarmentosa species complex.

Materials and methods
Sample collection, DNA extraction and resequencing, and genome assembly and annotation Healthy, young leaves were collected from the field in 2015-2021, and each was sealed in silica gel. Based on the phylogenetic relationships outlined in Zhang et al. [4], we sampled eleven taxa within the F. sarmentosa species complex in order to maximize genetic coverage. Detailed sample information is shown in S1 Table. All voucher specimens were stored at the herbarium of East China Normal University (HSNU).
Total DNA was extracted from 100 mg of dry leaf tissue using the CTAB method [41]. After quality detection with NanoDrop and Qubit 2.0, purified DNA samples were randomly ultrasonicated into~350 bp segments, which were subsequently used to construct paired-end libraries. Whole-genome resequencing (WGS) was carried out using the Illumina NovaSeq 6000 platform, according to the PE150 sequencing strategy. Raw reads were filtered and cleaned according to the following criteria: reads containing > 10% unidentified nucleotides, > 50% low-quality bases (Q�5), or adapter sequences were omitted for further analyses. Finally, the chloroplast genome sequences of two more taxa in the F. sarmentosa complex, F. sarmentosa var. henryi (GenBank accession no. OL415083) and F. dinganensis (GenBank accession no. OK375500), were included for further comparative analyses.
Clean data were used to assemble the complete circular chloroplast genome with GetOrganelle v1.6.4 [20], utilizing the "embplant_pt" model. In the case that the output sequence was not circular, the R (number of runs) and w (word size) parameters were adjusted until circularity was achieved. Chloroplast genome annotation was carried out using PGA [42], with the default parameters. The chloroplast genome was visualized using the online OGDRAW tool (https://chlorobox.mpimp-golm.mpg.de/OGDraw.html) [43]. For consistency, the two supplementary plastome sequences (OL415083 and OK375500) were re-annotated according to the same routine.
One of the inverted repeat regions (IRb) was removed in the REPuter analysis to avoid repeatable results. Simple sequence repeats (SSRs) were detected using MISA (https://webblast.ipkgatersleben.de/misa/) [46], with the following parameters: 8 repeat units for mononucleotide SSRs, 5 repeat units for dinucleotide SSRs, 4 repeat units for trinucleotide SSRs, and 3 repeat units for tetra-, penta-, and hexanucleotide SSRs. The maximal sequence length between two SSRs was set to 100 bp.

Comparison of complete chloroplast genomes and diversity hotspot analysis
All 13 F. sarmentosa complex genomes were compared using the mVISTA online tool (https:// genome.lbl.gov/vista/mvista/submit.shtml) [47], with the global multiple alignment model (LAGAN). The F. anserine chloroplast genome was used as the reference and the RankVISTA probability threshold was set to 0.5. All genes and intergenic regions were extracted from the Genbank annotation files in batches using Perl scripts created by Xiao-Jian Qu (https://github. com/quxiaojian/Bioinformatic_Scripts). Alignments of the genic and intergenic loci were carried out using MUSCLE v5.1 [48], with default parameters. After alignment, the nucleotide diversity (π) was calculated for all genic and intergenic loci.

Codon usage analysis
To compare codon usage patterns of all the CDS sequences across 13 F. sarmentosa complex genomes, the relative synonymous codon usage (RSCU) was calculated using DAMBE v7.3.11 [49].

Selective pressure analysis
The ratio of nonsynonymous (Ka) to synonymous (Ks) substitution can be used to quantify evolutionary selective pressure. The Ka/Ks ratio was calculated for all 79 unique CDSs using TBtools v1.09876 [50]. Positive selection was indicated by Ka/Ks > 1, negative (purifying) selection was indicated by Ka/Ks < 1, and neutral selection was indicated by Ka/Ks = 1. Based on the phylogenetic framework outlined in Zhang et al. [4], F. simplicissima was chosen as the reference to calculate Ka and Ks between F. simplicissima and our sampled representatives of the F. sarmentosa complex. For visualization purposes, the NaN value (i.e., both Ks and Ka = 0) was manually set as "1" to denote neutral selection. Finally, the infinity value (Ks > 0 and Ka = 0) was counted alone.

Phylogenetic analysis
The 13 F. sarmentosa complex chloroplast genomes were combined with 36 Ficus chloroplast genomes from GenBank and 59 Ficus genomes published by Bruun-Lund et al. [26]. Additionally, 18 genomes from the China National GeneBank (accession number: CNP0001337) and Genome Sequence Archive (accession number: PRJCA002187) [15,51], including 8 samples belonging to the F. sarmentosa complex, were assembled to further explore the Ficus phylogenomics and the potential of chloroplast genomes to resolve the evolutionary relationships among the closely related species. Seven samples from the Olmedieae tribe were chosen as the outgroup, according to previous studies [26,52]. After discarding four samples found to be more than half (> 50%) missing data (three in the genus Ficus and one in the outgroup), a total of 129 chloroplast genomes were used to construct the phylogenetic tree (detailed samples information shown in S1 Table). Additionally, one of the two IRs was removed. The 129 genomes were aligned using MAFFT v7.490 [53], with the "auto" model. The aligned sequences were trimmed with trimAl and sites with > 10% gaps were removed [54]. The maximum likelihood (ML) tree was constructed using IQ-TREE 2 with ultrafast bootstrap (-bb) and aLRT test (-alrt) numbers set at 10000. The optimal nucleotide substitution model was chosen with ModelFinder [55].

IR contraction and expansion
Across all 13 F. sarmentosa complex genomes, four junctions among IR, LSC, and SSC regions were compared (Fig 2). Invariably and consistently across all 13 genomes, the boundary between the LSC and IRb regions (JLB) was located within the rps19 gene, 171 bp away from its starting base and 108 bp away from its ending base. The boundary between the IRb and SSC regions (JSB) was located within the ndhF gene, either 17 or 25 bp (F. pumila) away from its starting base and 2,236 (F. anserina, F. pubigera, F. sarmentosa var. henryi, and F. sarmentosa var. sarmentosa), 2,237 (F. pumila), or 2,245 bp (all other taxa) away from its ending base. The boundary between the SSC and IRa regions (JSA) was located within the ycf1 gene, either 4,713, 4,722, or 4,724 bp away from its starting base and 1,024 (F. pumila) or 1,026 bp away from its ending base. For the IRa and LSC regions, the junction between them was located

PLOS ONE
Comparative chloroplast genomics of the Ficus sarmetosa complex between the rpl2 and trnH genes, either 62 or 63 bp away from the starting base of the trnH gene.

Long repeat sequences and simple sequence repeats (SSRs)
Across all 13 F. sarmentosa complex genomes, a total of 373 long repeat sequences were identified, representing all four repeat categories: forward (F), reverse (R), complement (C), and palindromic (P) (Fig 3A). All four categories of repeats were detected within all 13 chloroplast genomes. Four taxa, F. anserina, F. dinganensis, F. polynervis, and F. sarmentosa var. sarmentosa, contained the greatest number of long repeat sequences (31), while F. sarmentosa var. thunbergii contained the least (26). Among all four repeats, P repeats were the most common across all 13 genomes, ranging from 14 (F. guizhouensis) to 16. There were relatively fewer R and C repeats, with F. polynervis, F. sarmentosa var. lacrymans, and F. sarmentosa var. sarmentosa containing the most (4). For the lengths of long repeat sequences, 30-39 bp is the most common ranging from 21 (F. sarmentosa var. thunbergii) to 26 (F. anserina and F. dinganensis) times ( Fig 3B). The long repeat sequences over 60 bp in length were the least, appearing only once. The maximum length among all the long repeat sequences was 64 bp.

Codon usage
Across all 13 F. sarmentosa complex genomes, the 79 unique protein-coding CDS regions were encoded by between 22,867 (F. polynervis and F. sarmentosa var. lacrymans) and 22,918 (F.  Table 2. Among these codons, CAU encoding histidine (H) was the most frequent, appearing 12,800 times across all 13 taxa. Except the stop codon, UGU encoding L-Cysteine (C) was the next rarest, appearing 718 times only. According to the RSCU analysis, GCU and CUU had the highest average values of 1.858 and 1.818, respectively (

Genomic divergence and hotspot regions
The divergence of whole sequence among the 13 F. sarmentosa complex genomes was analyzed using the mVISTA online platform with F. anserina as a reference. The results showed that the full-length chloroplast genomes were largely conserved across all 13 taxa. The majority of variable sites were located in intergenic spacer regions (marked red in Fig 5). Interestingly, IRs were found to be more conserved than either LSCs or SSCs. Nucleotide diversity (π) was calculated for each gene and intergenic region to evaluate genetic differentiation and detect hyper-variable segments. Among 60 genes > 200 bp in length, rps15, ycf1, rpoA, ndhF, and rpl22 exhibited the highest nucleotide diversity: 0.00535, 0.00462, 0.00456, 0.00427, and 0.00377, respectively (Fig 6). The alignment lengths of these five genes ranged from 282 to 5,748 bp. Seven genes (atpI, psbE, rpl33, rps18, psbH, rrn23, and psaC) were found to be identical, with a nucleotide diversity of zero. The average diversity of the 60 genes was 0.001676, while the average diversity of intergenic spacer regions was 0.00435, approximately 2.6 times that of the genes (Fig 6). Among the intergenic spacer regions, the highest nucleotide diversity was exhibited by trnH-GUG-psbA (0.01458), followed by rpl32-trnL-UAG (0.01225), psbZ-trnG-GCC (0.01148), trnK-UUU-rps16 (0.01144), and ndhF-rpl32 (0.01112). The alignment lengths of these regions ranged from 366 to 1,829 bp (Fig 6).

Selective pressure analysis
The ratio of nonsynonymous (Ka) to synonymous (Ks) substitutions was calculated to quantify the evolutionary selective pressure on the F. sarmentosa complex, with F. simplicissima used as the reference genome. Overall, Ka/Ks ratios of most genes were < 1 (Fig 7). Additionally, twenty genes were found to contain no substitutions, i.e., both Ka and Ks are zero (shown as "1" in Fig 7). However, five genes (accD, matK, ndhF, rpoA, and ycf1) had partial Ka/Ks ratios over 1, which are potential signals of positive selection. An infinite Ka/ Ks ratio (Ka > 0 and Ks = 0) existed in 27 genes (Fig 7). Notably, ten genes (accD, clpP,  ndhK, rbcL, rpl20, rpl22, rpl23, rpoC1, rps15, and rps4) possessed infinite Ka/Ks ratios in more than half of the taxa.

Phylogenetic analysis
The chloroplast phylogenomic ML tree illustrated a well-supported phylogenetic relationship in the genus Ficus (Fig 8). Subgenus Pharmacosycea sect. Pharmacosycea was strongly supported as a sister group to the rest of the genus Ficus (SH-aLRT = 100 and MLBS = 100) Additionally, a clade including subg. Urostigma sect. Galoglychia and Americana, subg. Sycomorus sect. Sycomorus, and a few species of subg. Pharmacosycea sect. Oreosycea (subser. Ablbipilae in Corner's system) (clade A) was found to be sister to the remainder of Ficus except sect. Pharmacosycea. Aside from these two clades, the remaining Ficus taxa formed three clades (clades B, C, and D in Fig 8), with unstable support (SH-aLRT = 64.9 and MLBS = 73). Clade B (SH-aLRT = 99.9 and MLBS = 100) was comprised of five different subgenera, while clade C (SH-aLRT = 100 and MLBS = 100) included all species of subg. Sycidium as well as members of each of the other five subgenera. Clade D (SH-aLRT = 100 and MLBS = 100) contained only two species in subg. Pharmacosycea sect. Oreosycea. On the whole, all six subgenera were found not to be monophyletic. With the inclusion of eleven additional samples, a phylogenomic analysis was carried out on 24 samples representing the 13 taxa in the F. sarmentosa complex (Fig 8). The results showed that the F. sarmentosa complex failed to form a monophyletic group. Except three samples which are unexpectedly embedded in a distinct clade with other members of subg. Synoecia (clade E), two distinct lineages were recognized, with the majority of nodes within these two lineages being well-supported (clades F and G). The six F. sarmentosa varieties were found to be scattered across clades E, F, and G, and only two individuals of F. sarmentosa var. lacrymans clustered together. The six F. pumila samples were also found not to be a monophyletic group embedded by F. sarmentosa var. thunbergii.

Discussion
The differentiation and diversity of the chloroplast genome in the F. sarmentosa complex To date, vast comparative chloroplast genomic studies have been conducted in a wide range of taxonomic levels, such as order (such as Dipsacales [56] and Saxifragales [57]), family (such as Orchidaceae [58] and Zingiberaceae [59]), and genus (such as Camellia [60], Lindera [61], Gossypium [62], and Ficus [17,51,63]). However, less research has focused on the comparative genomics of taxa undergoing recent speciation, such as the species complex [29]. Although the structural conservation of the chloroplast genome at low taxonomic levels is well-characterized [64][65][66][67], the detailed patterns of genomic differentiation and diversity among closely related species remain largely unknown. Therefore, comprehensive comparisons between closely related species are necessary to improve our understanding of the mechanisms, rates, or directionality of genome evolution during the early stages after speciation [29].
In this study, we investigated the evolutionary dynamics of thirteen high-quality chloroplast genomes from the F. sarmentosa complex. Overall, the lengths of both whole-genome and quadripartite regions were quite similar among taxa, with only 0.2288%, 0.3784%, 0.2188%, and 0.1236% variation among whole genomes, LSCs, SSCs, and IRs, respectively ( Table 1, Fig  1). Furthermore, the number, content, and orientation of annotated genes among all 13 genomes were identical. The IR boundaries (JLB, JSB, JSA, and JLA) were also relatively coincident among the 13 genomes, being located at the same loci with only slight variation in the distance to the starting or ending bases. For long repeat sequences, all four repeat units were shared among all 13 plastomes, and the number of repeat units and their proportions exhibited only slight differentiation. For example, the proportion of palindromic sequences ranged from 45.16 to 61.54% (Fig 3). Similarly, in SSR regions, the proportion of the mononucleotide repeat units ranged from 72.63 to 85.96% (Fig 4).
The high conservation exhibited across the F. sarmentosa complex is consistent with other studies on closely related taxa. For example, a study of 22 closely related Oryza species indicated that conservation was common at lower taxonomic levels [29]. Even in morphologically diverse shrub willows (Salix), such high conservation is still exhibited [68]. A study of four peanut varieties serves as a more extreme example, reporting perfectly identical IR boundary junction positions [69]. Although comparative chloroplast genomic studies have rarely focused on closely related species, the high conservation of chloroplast genome among closely related species is recognized based on our work and other related research.

Chloroplast genomic evolutionary hotspots
Although chloroplast genomes exhibit high conservation among closely related species, discrepancy and heterogeneity have also been widely observed across the whole genomes. Overall, the variable sites of the two single-copy regions (LSC and SSC) are more abundant than IR regions in both the genus Ficus ( Fig 5) and most other plant groups [56,70,71]. Chloroplast regions with different mutation rates are appropriate for a range of evolutionary research. In general, conservative coding genes are suitable for deep phylogenetic inferences at the familylevel [28,72,73] or higher [74][75][76], whereas highly variable regions are appropriate for studies of biogeography, species delimitation, population genetics, and phylogenetic reconstruction at lower infra-generic levels [77,78].

The plastome phylogeny of the genus Ficus and the phylogenetic performance in closely related species
Based on a compilation of nearly all available Ficus chloroplast genomes, we obtained a robust ML phylogenetic tree with the majority of nodes exhibiting high bootstrap and SH-aLRT values, particularly the deep nodes (Fig 8). Overall, our ML phylogenetic tree is largely consistent with previous research [17,26], including the systematic position of subg. Pharmacosycea sect. Pharmacosycea and the mysterious displacement of certain individuals (F. fulva, F. magnoliifolia, F. albipila, F. albert-smithii, and F. pumila). However, analysis of our extended dataset highlighted an increase in non-monophyletic groups, such as subg. Urostigma sect. Urostigma and Conosycea (Fig 8). Notably, there were numerous incongruences between the chloroplast cladogram and previously published nuclear trees. For example, the chloroplast-based tree failed to support either Ficus subg. Sycomorus or subg. Sycidium as monophyletic groups, whereas these groups are wellconfirmed in nuclear phylogenies [3,[12][13][14]81]. Subg. Synoecia was also divided into three different clades (Fig 8). Further research into the displaced species, such as F. fulva, F. ablert-smithii, F. magnoliilolia, and F. albipila, may reveal host shifts or nonspecific pollination between fig trees and fig wasps [26]. Considering that a stable and comprehensively-sampled nuclear phylogenomic framework for the genus Ficus is still lacking, precise identification of these hybridization events will require more robust nuclear genome data as well as data from the associated fig wasps.
Compared to the nuclear phylogeny [4], we discovered a disparate evolutionary relationship among taxa in the F. sarmentosa complex, including three distinct clades (Fig 8, clades E, F, and G). Three samples from the complex dispersed into the clade E mixing with F. sagittata and the members of subg. Synoecia sect. Apiosycea. Unless more data support hybridization, misidentification may be an alternative explanation. Neither geographic nor morphological traits could be detected to support the split of clades F and G. Hybridization between F. pumila and F. sarmentosa var. thunbergii might exist, considering that the latter embedded into the former within clade G. Although our current samples are insufficient to fully resolve the taxonomy of the F. sarmentosa complex, the phylogenetic resolution provided by chloroplast genomes in the complex appears to be promising, as almost all the nodes are strongly supported. The chloroplast genome has also shown high discriminability across the F. auriculata complex (Fig 8, blue labels) [82][83][84]. The relationships between four taxa (F. auriculata, F. oligodon, F. hainanensis, and F. beipeiensis) in the complex were well-resolved, while F. beipeiensis shared a distinct phylogenetic relationship with the climbing fig tree F. tikoua. Moreover, a new linage (F. northern) was identified based on chloroplast genomics, suggesting a promising segue into further exploration of the cryptic species (Fig 8, blue labels) [17]. Chloroplast genomes have been used to reconstruct high-resolution phylogenetic trees in other closely related species groups, such as peanut [85], rice [29], willow [68], and orchardgrass [86]. Taken together, the chloroplast genome appears to be a promising tool for exploring the evolutionary relationships between closely related species and even species complexes, although cyto-nuclear discordances often exist.

Conclusions
In this study, eleven F. sarmentosa complex chloroplast genomes were newly sequenced and characterized. Sequence lengths, IR boundaries, repeat sequences, and codon usage were compared among these eleven, and two previously-reported, chloroplast genomes, indicating that these parameters were highly conserved across taxa. However, heterogeneity was found in both nucleotide diversity and selective pressure among segments. We characterized ten evolutionary hotspot regions (rps15, ycf1, rpoA, ndhF, rpl22, trnH-GUG-psbA, rpl32-trnL-UAG, psbZ-trnG-GCC, trnK-UUU-rps16 and ndhF-rpl32). Phylogenomic analysis indicated that chloroplast genomes show promise for inferring the phylogenetic relationships between closely related groups, despite cyto-nuclear discordance.
Supporting information S1 Table. The collecting information of samples in the study. (XLSX) S2 Table. Detailed information for the high-throughput sequencing data in the study. (XLSX)